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What is tlie size of tiie most massive object one expects to find in a survey of a given volume? In 

this paper, we present a solution to this problem using Extreme- Value Statistics, taking into account 

primordial non-Gaussianity and its effects on the abundance and the clustering of rare objects. We 

calculate the probability density function (pdf) of extreme-mass clusters in a survey volume, and 

show how primordial non-Gaussianity shifts the peak of this pdf. We also study the sensitivity of 

>--^ , the extreme-value pdfs to changes in the mass functions, survey volume, redshift coverage and the 

normalization of the matter power spectrum, erg. For 'local' non-Gaussianity parametrized by /nL) 

^ ' our correction for the extreme- value pdf due to the bias is important when /nl > 100, and becomes 

D ^ more significant for wider and deeper surveys. Applying our formalism to the massive high-redshift 

[] . cluster XMMUJ0044. 0-2-33, we find that its existence is consistent with /nl ~ 0, although the 

conclusion is sensitive to the assumed values of /gjjy and erg. We also discuss the convergence of 

the extreme-value distribution to one of the three possible asymptotic forms, and argue that the 

convergence is insensitive to the presence of non-Gaussianity. 
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I. INTRODUCTION 



The statistics of the primordial seeds that grew into the observed large-scale structures holds a wealth of information 

about the physics of the primordial Universe. In the simplest models of inflation, the primordial density fluctuations 

obey an almost Gaussian statistics (see jl| for a review). Tiny deviations from Gaussianity may be quantified, amongst 

^ , other ways^, by the 'local' non-Gaussianity parameter, /nl, defined via the expansion of the non-linear Newtonian 

potential 



$-0 + /nl(0'-(0')) + ..., (1) 

where ^i is a Gaussian random field. This form of non-Gaussianity arises in simple models of single and multi-field 
inflation [2,SIJ1 as well as some curvaton models [a,SI3- Observational constraints on /nl from the cosmic microwave 

l/~j background (CMB) anisotropics are currently consistent with /nl = 32±42 (2(t) [g]. However, if /nl is in fact much 

, [ \ smaller, its effects on the CMB would be difficult to extract and distinguished from non-Gaussianity arising from 

r-^ ■ secondary sources such as gravitational lensing and instrumental noise [9, 10]. 

The statistics of large-scale structures offers a complementary probe of non-Gaussianity on much smaller scales 
than the CMB. Indeed, it is possible that /nl measured on Gpc scales may be quite different from that measured on 
Mpc scales. In wavenumber space, this translates to a possible fc-dependence of /nl, which have been hinted at by the 
numerous observations of massive high-redshift clusters [111, [l3: E, H, 13 • These massive clusters exist, according to 

y^ ■ some, in greater abundances than expected from a Gaussian statistics. Some authors have concluded that the level of 
H non-Gaussianity on Mpc scale required to explain the existence of certain rare clusters is /nl = a few xlO^ [la, [l7|. 

- - 1 In contrast, some have argued that these claims are based on misinterpretation of data, and that the occurrences of 
these rare objects are in fact consistent with a Gaussian statistics [isl. [l9l. [2q| . 

In this work, we offer our opinion on this debate by presenting an approach to calculating the probability of observing 
rare objects based on extreme-value statistics. We begin by asking: what is the probability distribution of the most 
massive clusters found within a given volume at a given redshift range? Our technique relies on a basic application of 
the so-called void probability distribution introduced by White [2l| . Thi s ap proach was successfully used to study the 
abundances of massive clusters given a Gaussian initial condition in [22l [23| . In this work, we extend the groundwork 
laid by these authors to study the effect of /nl on the distribution of extreme-mass objects. For other cosmological 
applications of extreme- value theory, see [H, [^ [H, [13, [H, [H, [s^, HH. 
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^ In general, a non-Gaussian pdf can have divergent moments (e.g. the Cauchy distribution). In this work we assume that the primordial 
density distribution has finite moments up to third order (i.e. finite skewness). 



Previous approaches to extreme-value statistics of clusters have so far either neglected the clustering, or bias, of 
galaxy clusters [3l|, [S^], or considered it in the Gaussian case [23, [23]- In this work, however, we have included the 
effects of the bias in the presence of non-Gaussianity. Whilst Davis et al. [22] have previously reported that the effects 
of the bias on the extreme- value distribution are small in the Gaussian case, it remains to be shown if this also holds 
in the presence of non-Gaussianity, which can introduce a stroiig scale dependence in the bias ^^. We investigate 
this problem in this work using the formalism of Valageas [SJ, |35|, who showed how the bias can be calculated in 
real space for a given /nl- As we shall see later, the contribution from the bias can indeed be significant if /nl is 
sufficiently large. 



II. THE PRIMORDIAL DENSITY FLUCTUATIONS 

In this section, we introduce the parameters needed to describe the primordial density fluctuations statistically. 
Some of our present conventions (such as the Fourier transform and the moments of the density fluctuations) slightly 
differ from our earlier work [361 137|. In particular, smoothing by a window function will be kept explicit, in contrast 
with other work in which overdensities are defined to be implicitly smoothed. 

Let pc, Pb, Pr, Pa be the time-dependent energy densities of cold dark matter, baryons, radiation and dark energy. 
Let Pm = Pc + Pb- We define the density parameter for species i as 



n,. 



P^iz = 0) 
Pcrit 



(2) 



where pcrit is the critical density defined as pcrit = 3Hq/8ttG. The Hubble constant, Hq, is parametrized by h via 
the usual formula Hq = lOOh km s^^Mpc^^. Results from a range of astrophysical observations are consistent with 
h ~ 0.7, flc ~ 0.23, rib ^ 0.046, fir ~ 8.6 x lO^^ and VIa ^ I - flm - ^r (see e.g. [i,|3|). 
The density fluctuation field, S, is defined at redshift z as 



(5(x,z) 



Pm(x, z) - {pmjz)) 
{Pm{z)} 



(3) 



where (pm) is the mean matter energy density. To make the notation less cumbersome, we shall write (5(x) to mean 
(5(x, z). The Fourier decomposition of 6{x) is given by 



5(x)= /"dk(5(k)e*''". 



(4) 



The gravitational Newtonian potential, $, is related to the density fluctuation by the cosmological Poisson equation 



5{k) 
A{k,z) 



-4(fc,z)$(k), 






TikEH)D{z), 



(5) 
(6) 



where T is the transfer function and D is the linear growth factor calculated using the fitting formula of [33 . |40| with 
D{Q) w 0.76. We follow the approach outlined in [41] and use the transfer function 



T(fc) 



ln[l 4- (0.124fc)2] 
(0.124fc)2 



1 + (1.257fc)2 + (0.4452/c)4 + (0.2197/fc)^ 
1 + (1.606/c)2 + (G.8568/c)4 + (G.3927/c)6 



1/2 



(7) 



In addition, we also incorporate the baryonic correction of Eisenstcin and Hu [4^ , whereby the transfer function is 
evaluated at 



«EH 



kn 



1/2 



Unil 



O^'m 



1 — a 



1 + {O.ASksY 



(8) 



with 



a = l-0.3281n(431fi™/i2)— ^ + 0.381n(22.317™/i2) (-A 



and 



44.51n(9.83/r!„/i2) 
s = — , Mpc. 

The power spectrum, P{k), can be defined via the two-point correlation function in Fourier space as 

((5(ki),5(k2))=5z5(ki+k2)P(fc), 



(9) 



where Sd is the S-dimensional Dirac delta function. In linear perturbation theory, it is usually assumed that inflation 
laid down a primordial spectrum of the form P{k) oc k"% where Ug is the scalar spectral index (assumed to be 0.96 
in this work). 

The variance of linear density fluctuations smoothed on scale R is given by 



P°° dk 
CT|j = 47r / —W^{kR)V{k). 
Jo k 



(10) 



where P(fc) = k^P{k) ex [Aik,z)]'^ (/fc/ffo)"^"^ . 

We choose W to be the spherical top-hat function of radius R. In Fourier space, we have 

„..,m o r sin(fci^) cosjkRy 

^(^^) - ^ [-JkRyr - -JkR^ 

The mass, M, of matter enclosed by a top-hat window of radius R is given by 



4 c, ,^ f R 

M = -nR^p,n ~ 1.16 X 10i2j7„, ^ .^^ 
3 \h ^Mpc 



h-^Ma 



(11) 



fl2) 



With the above relation, the smoothed variance, a^, can be equivalently expressed as ctm- Finally, the normalization 
of V{k) is such that 



as = (J{R = S/i^^Mpc, z = 0) = 0.801. 



(13) 



III. CLUSTER NUMBER COUNTS 



The mean number density, n, of objects with mass greater than to, at redshift z can be calculated by 

/■°° dn 
ni>m,z)^l —dM, (14) 

where dn/dM is the differential number density. In the presence of local non-Gaussianity, Matarrese, Verde and 
Jimenez [4^ used a saddle-point expansion (assuming that the deviation from Gaussianity is sufficiently weak for 
such an expansion to converge) to derive a correction factor for dn/dM of the form 



TZ — exp 






r A2 



dS, 



6 A d In cta/ 



A 



A = 



ScS3 



(15) 
(16) 



where the third cumulant, S'3, is given by S3 = (S^), and is assumed to be almost independent of the smoothing 
mass-scale, M. This latter assumption indeed holds on cluster scales (see ((T8)) below). S'3 can be calculated either 
from a 3-dimensional integral 



S^iM) 



GfR-RRiO) 



'M 



with fn-BR defined in equation p4[) . or from the fitting formula [3£ 



SsiM) 



3.15 X 10~Vi 



NL 



-.0.838 



(17) 



(18) 



The critical overdensity, Sc, is taken to be^ 



Sc ~ \fa X 1.686, 



(19) 



with the 'fudge factor' ^/a — 0.9 as recommended by [45|, although there is still debate over its value [44 l46l| . 

There are alternative forms of the correction factor, 7?,, given by LoVerde et al. [47| based on a low-order Edgeworth 
expansion, and by Paranjape et al. (481] based on resumming terms in the saddle-point expansion of the mass function. 
We tested both of these alternative corrections and found that, in the range of parameters used in this paper, there 
are only small differences between the various prescriptions and our main results are unaffected by the choice of the 
correction factor. In the rest of this work, we shall use only the MVJ correction factor (see j45ij for a comparison 
between the correction factors). 

In summary, we shall consider the non-Gaussian differential abundance of the form 



where v = 5c/ <jm and F{v) is one of the following three standard mass-functions 



(20) 



Press-Schechter [48 

Sheth-Tormen [sC 
Tinker et al. [HI, [52 



F 



PS 



F, 



ve 



^z.V2 



ST — 0.322a/ — i^exp 

TT 



av 



^Tinker — 0.368 



1 I / 2\-0-3 

1 + [av ) 



^2n+l^~-,u^/2 



a = 0.707, 



13 = 0.589(1 + z)°-2, (j) = -0.729(1 + z)-•'■°^ 
r/ = -0.243(1 + z)"-27,7 = 0.864(1 + z)""-"^ 



(21) 

(22) 
(23) 



The Press-Schechter and Sheth-Tormcn mass functions can be derived by considering the overdensity, S, as a stochastic 
function of the smoothing mass scale, M, and associating trajectories (in the (M, 5) plane) that overshoot a barrier, 
6c, with a collapsed object. One can show that a spherical collapse can be associated with a barrier of constant 
height, resulting in the Press-Schechter mass function [40|, whilst an ellipsoidal collapse can be associated with a 
drifting barrier, Sc = 1.686(1 + a{S/a)") (a,/3 constant), giving the Sheth-Tormen mass function (50|. The Tinker 
mass function belongs to a family of so-called universal mass functions derived from a suite of iV-body simulations, 
with the functional form deviating from simulation results by < 5% in the redshift range considered here (for detail 
see [Hill, and also 11 mill)- 

The number of objects with mass above M expected at redshift z is given by the integral 



dV_ 
dz 



M 



dn 

— — dm, 

dm 



where the volume element dV/dz satisfies 



dV_ 

dz 



f. 



An 



sky" 



dz' 



-'H{z) yjo H{z'), 

H{z) « Ho[n^{i + zf + ns\"\, 



(24) 

(25) 
(26) 



and /sky is the fraction of the sky covered by the survey. The number count for z — 1, /sky = 1 and /nl = or 
100 is shown in figure [T] Comparing the mass functions, we see that the Sheth-Tormen gives the highest number 
count, followed by the Tinker and the Press-Schechter mass functions. Changing /nl to 100 (right panel) increases 
the number count at the high-mass end by roughly an order of magnitude. See e.g. [46l |56| for more comparisons 
between various mass functions. 



^ Note that &c is taken to be constant, whilst the redshift dependence is carried by the factor A(k, z). This convention agrees with |34|, |35 
but is different from the "excursion-set" convention in which the redshift dependence is carried by Sc, with cr extrapolated to z = (e.g 
[44l ). We believe our present convention will facilitate comparison with 1341. |35|| . whose results will be used in the next section. 
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FIG. 1: The number of objects above mass M with /nl ~ (left) and 100 (right) at z — 1, calculated over the full sky using 
the Press-Schechter (P-S), Sheth-Tormen (S-T) and Tinker et al. mass functions. With /nl ~ 100, the number count 
increases by roughly an order of magnitude at the high-mass end compared with /nl ~ 0. 



IV. BIAS 



In the seminal work of Dalai et al. [331] . it was shown quantitatively how non-Gaussianity gives rise to characteristic 
changes in the clustering of density peaks corresponding to rare objects. At leading order, it is common to define the 
bias in Fourier space as the ratio of the power-spectra 



b\k) 



fhalo(fc) 



(27) 



which represents the amplitude at which density peaks (Phaio) trace the underlying dark matter distribution {Pm)- 
The Fourier space formalism was used by a majority of papers on non-Gaussian bias {e.g. [4^ |53, lla IIE l60|). 

However, an arguably more intuitive measure of the bias is in real space, where the density fluctuation in peaks 
(i.e. luminous objects) is expressed as a non-linear function of the local dark- matter density fluctuation. On linear 
scales, the bias is given by the ratio of the correlation functions [61,, i62|] 



h'ir) = 



epk(r) 
CM ' 



(28) 



where r is the comoving length in Eulerian space (throughout this work quantities with a subscript 'pk' are associated 
with density peaks). The correlation function, S,, is defined as 



e(xi,X2) = (<5(xi),^(x2)), 



Xi -X2 



On linear scales where Vik) is of a power-law form parametrized by ng, we can write 



^(r) = 47r 



dk 



rik)jo{kr), 



(29) 



(30) 



where jo{x) = sinx/a;. The real-space bias tells us directly about the clustering amplitude of density peaks separated 
by distance r. We shall refer to r as the separation length. 

Unfortunately, when comparing (P7|) and (pS)) . we see that the real-space bias, b{r), and the Fourier-space bias, 
b{k), are not simply related via a Fourier transform but rather a complicated convolution. In |37| . we avoided this 
problem by interpreting (j28p as a ratio of joint probabilities of finding overdensities at two points distance r apart, 
and then applying a bivariate Edgeworth expansion. Due to the algebraic nature of the Edgeworth expansion, this 
technique was readily applied to non-Gaussianity parametrized by the cubic order parameter, ^nl, but surprisingly 
the application is much less straightforward for /nl- 

An alternative method for calculating the real-space bias in the presence of /nl was presented by Valageas [SJ, |33 
in which he showed that analytic calculations could be made as long as the separation length is sufficiently large. In 
this work, we shall follow this formalism, of which we give a simplified account here. 



A crucial element in the real-space approach is the mapping between the separation length, s, in Lagrangian 
coordinates (associated with linear density fluctuations) and that in Eulerian coordinates (associated with non-linear 
fluctuations). This relation is given by 



s ~ r 1 



2SRir) 



(31) 



accurate at large distances where Sji{r) <^ 1. Here dnir) can be interpreted as the radial profile of the linear density 
contrast from the centre of the halo. The profile is given by 



5c 5"^- 



fo;RR{r) + 2gR.oji{r) - 3 



o'fl,o(^) 



fR:RR{0) 



(32) 



In this expression, the functions an-^^Rr^ir), fR-,RiR^{r) and gR-Rj^R2ir) are defined by the following integrals^ 

P°° dk 
4,.ii.W = 4^y^ —V{k)W{kR,)W{kR2)j^{kT), 

fR:R^RAr) = 8^2^(0)/nl r ^V{k,)W{k,R,) r ^V{k2)W{k2R2) f df,W{kR)—^ 

Jo ki Jo fc2 J-1 A(ki 



(33) 



)A{h 



-jo(fcr-), (34) 



gR;R^RAr) = 8^'i?(0)/NL / ^V{k^)W{k^R^)Jo{k^r) / -lv{k2)W{k2R2) / diiW{kR)-— 

Jo ki Jo fc2 7-1 A[ki 



A{k) 



)Aik2) 



m 



where /i is the cosine of the angle between ki and k2 , and k — \/k\ + A:| -(- 1k\k2[i.. 

With these definitions, Valageas showed via a saddle-point expansion that the bias for objects mass M is given by 



}?{M,r) 



<o(^) 



(1 + 8R{s))e^'^^ 



where 



A(s) 



<R{'n 



261 



u = cr% + <7J^,R{s). 



fR,RR{s) + 2<?fl,;flfl(s) + 1 - ^ fR,RR{0) 



(36) 

(37) 
(38) 



Figure [21 shows the real space bias for a range of M and r. Keeping r fixed and varying M (panel on the left), 
non-Gaussianity shifts b{M) up or down (depending on the sign of /nl)- On the other hand, keeping M fixed and 
varying r, we see how nonzero /nl introduces a scale dependence on 6(r) (6(r) is roughly constant on large scale if 
/nl = 0). This scale-dependence is similar to that seen in [37| for (7nl- 

In the limit of large separation length, the non-Gaussian bias b{r) (with M fixed) follows a simple scaling relation. 
Given /nl, we can define the deviation from the Gaussian bias as A6 = &(/nl) ^ ^(/nl = 0). For r ^ _R, it was 
shown that [34 



A6 ex /nl6(/nl = 0) 



1 2 



h-^Mpc 



(39) 



so that the overall scaling is A5 ^ r^, since 6(/nl — 0) is approximately constant for large r. 

In later work, it will be necessary to define the effective bias associated with a comoving volume. For a spherical 
region of comoving radius L, we define such a bias as 



bL{M) = -L /" dxi /" dx2 b{M, |xi - X2I), , 



(40) 



In this paper /nl is defined in the 'LSS' convention. The 'CMB' convention, as used in |34| . satisfies /iSjj = D{0)fii?° 
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FIG. 2: The effect of /nl on the real-space bias, b, at z = 1. In the panel on the left, the bias is shown as a function of 
smoothing mass-scale, M, with separation length r — 50/i~^Mpc. The other panel shows the effect of varying r with 
M = lQ^*h~^MQ, illustrating the scale-dependence of the bias when /nl ~ ±100. 



It will also be useful to define the effective bias for objects of mass > M 

1 



b{> M, r) = / 6(m, r)-—dm. 

n[> M) J 1^1 dm 

Combining the averaging processes (l40t - (|4T|) . we can define 

/3(i,Af) = 6L(>M), 

which, as shown in the appendix, simplifies in the limit r 3> R{M) to 

p{L,M)^F{L)G{M), 



where 



F{L) 
G{M) 



5 



L 



ni> M) Jm 



/i-iMpc 
&(™, /nl = 



0)- — dm. 
dm 



(41) 

(42) 
(43) 

(44) 
(45) 



An example of the effective bias, /3, with L — YQQh^^Mpc is shown in figure [H in which we set z = 1 and use the 
Tinker mass function for dn/dM, although using a different mass function only results in small differences. Comparing 
this graph with that of b{M) (left panel of figure [5]), we see that whilst [3 retains the overall shape of the curves, it 
is clearly more sensitive to non-Gaussianity. This quantity will be especially useful in the next section in which we 
consider the clustering of massive objects within a specified volume in the presence of non-Gaussianity. 

We conclude this section with a brief comparison between the non-Gaussian imprints in the bias and in the cluster 
counts. Because the non-Gaussian imprint on the clustering of biased objects is significant only on large scales, 
surveys covering a large volume (~ 0(10) Gpc'^) will be required. Preliminary forecasts have shown good prospects of 
achieving /nl ~ C(1) constraints from measurements of the bias with upcoming surveys such as DBS'*, Euclid^ and 
LSST^ [63l |64| . On the other hand, cluster number counts, whilst not requiring a large- volume survey, are almost 
completely insensitive to the shape of non-Gaussianity [43, 163- The bias probes correlation between scales and is 
therefore sensitive to the shape of non-Gaussianity, particularly the local shape, whereas the equilateral shape shows 
up only weakly in the bias 66] . This suggests that a combination of these probes will be required to constrain both 
the amplitude and the shape of non-Gaussianity. 



4 www.darkenergysurvey.org 

http://sci.esa.int/euclid 
® www.Isst.org 
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FIG. 3: The effective bias, /3 (defined by Eq. I43p associated with objects of mass greater than M in a spherical volume of 
radius L — I00h~ Mpc. Compared with figure [2] the effect of non-Gaussianity on /3 is much more apparent. 



V. EXTREME- VALUE DISTRIBUTIONS 

In this section, we present the calculation of the distribution^ of extreme-mass clusters. The necessary ingredients 
are the non-Gaussian number density and real-space bias calculated in the previous sections. 

A. Distribution function 

White [21| derived the following expression for the cumulative probability that a region of volume V contains no 
object of mass M and above 



P{M) = exp 



i:t!fc^ n/^dx,Uf (..,.„, ^^x.) 



.k=l 



(46) 



where S^f =1, '^(> M) is given by (1141) and ^^ is the /c-point correlation function of density peaks in V associated 
with halos of mass > M. As in [22, l23|, we shall at times refer to ^ as a 'patch'. If we take the patch to be a sphere 
of comoving radius L, the volume- averaged correlation then simplifies to the cumulant (connected moment) smoothed 
by a top-hat window of radius L as follows 



n/,^Uf(x.^... 



•Xk) 



('5p\)c(L) 

(aP'^(L))2'=-2^f(L) 
i/3{L,M)aL)^''-'Sf{L) 



(47) 



where /? is given by 1421 The cumulants for density peaks have been calculated in the context of hierarchical structure 
formation with Gaussian initial condition [63, [H, |6^, llO] • In the presence of non-Gaussianity, however, the perturba- 
tion theory required to calculate the cumulants for density peaks becomes much more complex (see e.g. |70l [tH. [73| ) . 
To make analytic progress, we shall consider only fc up to 3 in the sum (|46p. The terms k = 1 and 2 correspond to 
well-known results previously found in [23l , |29| , namely 



nV+-inVl3aL)\ 



(48) 



''' We use the word 'distribution' in the strict sense, referring to the cumulative distribution and not the pdf. 



where we have used the reduced cumulant 5*2 = 1. Given a weakly non-Gaussian initial condition, the third cumulants 
for density peaks are expected to be dominated by nonlinear gravitational effects, since primordial non-vanishing 
cumulants decay at the rate Sk./D'^^^(z) [73| . Neglecting these effects, we can use the expression for the cumulant of 
the lognormal distribution [7J| 

Sf{L)=3 + al (49) 

which was found to be in fair agreement with A'^— body simulations of non-Gaussian models with |/nl| as large as 
1000 (at least in the quasi-linear regime with cr ~ 1) 44]. This approximation is sufficient for the range of cluster 
masses (> IO^'^H'^Mq) examined in this work. 

Collecting these results, we find the extreme-value distribution 

InP(Af) « -X + ^X^Y'^ -^X^Y^SfiL), 
where X = n{> M)V, Y = P{L,M)aL. (50) 

Setting (|50|) equal log (1/2) (i.e. the median value of M^ax) gives an estimate of the modal value of Mmax, at 
least in weakly non-Gaussian distributions (see [29| for the Gaussian case). However, the equation is non-linear in 
-^^max and the approximate Mmax dependences in these terms are not intuitive. Instead, we look for the peak in the 
derivative of (j50p . i.e. the probability density function. Nevertheless, the shape of the distribution function holds 
valuable statistical information to which we shall return when we consider the extremal-type distributions in Appendix 
B. 

B. PDF of extreme-mass objects 

We can obtain the probability density function (pdf) for the most massive objects expected in a volume by differ- 
entiating the distribution function (|50p with respect to M , noting that the only dependence on M is in the number 
density, n, and the bias, h. The result is 



P{M) = ^ = VP{M) 



"•^ I ^ , „TAo2_2 ^ „2jr2 oAA c IT\\ I „2t^o_2 "P /i 2„tao2_2 



— ( -1 + nVP'ai - -n'V'/3^alS,{L)\ + n'VPai^ ( 1 - -nV^'aiSsiL) 



m 



where dn/dM is given by ([20)) (note the subtlety that dn{> M)/dM = —dn/dM). Here, we see explicitly that the 
pdf of extreme-mass objects not only depends on the bias, but also on its mass variation, d/3/dM . 

C. /nl and extreme- value pdf 

The main results of this paper are shown in figure |H The panels show the probability density function (|5ip for 
the three mass functions at redshift z ~ 1, 1.6 and 3 (corresponding to the left, middle and right column) with 
/nl — 200, 100 and (top, middle and bottom row respectively). The survey volume is taken to be a sphere of radius 
100/i^^Mpc. To display the correct scaling on the horizontal log scale, we plot dP/d log Af^ax on the vertical axis 
whilst the actual value of the pdf is rfP/dMmax. From these graphs, we make the following observations: 

(a) Going from the bottom row to the top, we see that increasing /nl increases the height of the pdf whilst positively 
skewing it {i.e. lifting the positive tail). This has the effect of increasing the mass of the most probable extreme 
objects in a given volume. 

(b) Going from the first column to the third, we see that at higher redshifts, the pdfs are more peaked and the 
peaks are located at lower Mmax. 

(c) The Shcth-Tormcn mass function predicts the largest mass of extreme objects, followed by the Tinker and the 
Press-Schechter mass functions. This is a consequence of their predicted number densities as seen in figure [TJ 

(d) The differences between the mass-functions become much more pronounced at high redshifts. In the third col- 
umn, we see a clear separation of the peaks for different mass functions, with non-Gaussianity further enhancing 
the differences. 



10 



O 
ID 

T3 









1.5 



0.5 



- 


'j 


P-S 

Tinker 

/^ S-T 

'A'. fNL=200 - 



1.5 



0.5 




1 


P-S 

^ . Tinker 

X'- S-T 

;\\ fNL=200 - 

;■ \\ Z=1.6 



10' 





P-S 




Tinker 




S-T 


- 


fNL=100 - 


'"nA>\ 


z=1.6 


'' /.■'\ 


\ 


/ /■' 


\\ ■ 









10' 
Mmax {Ii-^Mq) 

FIG. 4: The probability density function of the maximum mass, A/max, of objects in a spherical volume of radius L = 100/i~ 
Mpc. In each panel, the mass functions used are Press- Schechter (dashed/magenta), Tinker (solid/blue) and Sheth-Tormen 
(dotted/red). Top row: f-^i^ = 200 with z = 1, 1.6 and 3 (from left to right). The pdf at the same redshifts are shown for 
/nl = 100 (middle row) and /nl ~ (bottom row). The non-Gaussian effects are most visible in the third column in which 
the peaks can be seen to move to higher Mmax with increasing /nl- 



(e) In figure[5l we show the effect of varying the patch radius, L from 100 to 500 /i^^Mpc (with z = 1 and /nl — 100) . 
By increasing L, the peak of the pdf shifts significantly to higher Mmax- The pdf also becomes more peaked 
with increasing L. This is simply due to the fact that as the sample size, L, approaches the population size, 
repeating the sampling will yield almost identical maxima in the samples. 

Finally, we investigate the relative importance of the three terms on the right-hand side of equation (|50p . We 
consider the extreme-value distributions in following cases 

(A) /nl = 100, L = lOO/i-^Mpc, z = 1, 

(B) /nl = 200, L = 500/i-iMpc, z = 3, 

(C) /nl == 0, L = 500/i-iMpc, z = 3. 

Figure [S] shows the resulting distributions when one, two or three terms on the right-hand side of ([50]) are taken into 
account (using the Tinker mass function). In case (A), we see that the Poisson approximation (keeping only the first 
term in (|50|) ) is fairly close to the 3-term result. Generally, this holds as long as the non-Gaussian effects are small 
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FIG. 6: The contributions of the 3 terms in (|50|) towards the extreme-value distributions, using 3 sets of parameters Left: (A) 
/nl = 100, L = 100/i^^Mpc, z^l. Middle: (B) /nl = 200, L = 500/i~^Mpc, z = 3. Right: (C) /nl = 0, L = 500/i"^Mpc, 
z = 3. In case (B), the Poisson approximation ('1-term') is clearly inaccurate. 



{i.e. at small volume, low redshift). However, in case (B), we see that the Poisson approximation underestim.ates the 
extreme cluster masses. In this case the peak of the pdf, though merely shifted by < 10%, is much narrower and rises 
to a higher maximum value (as can be estimated by eye from the slope around P = 0.5). In both cases, however, it 
is inconsistent to include the bias but ignore the third term (representing the skewness of the halo distribution in the 
sample). This is shown in the upturns of the 2-term distribution functions. In case (C), where /nl = 0, the 2nd and 
3rd correction terms are negligible and the Poisson approximation is very good indeed. 

Strictly speaking, the calculations here are valid only for L ^ r. In actual applications, we will be interested in 
the cases where L = 0{& few Gpc). In such cases, the redshift variation within the patch must be taken into account 
(as emphasised in [22|). This requires replacing the number density, bias and cumulants by their average within a 
comoving volume. We shall demonstrate this in the next section. 



VI. APPLICATIONS 



A. A massive cluster at z = 1.579 : a problem for ACDM? 



Santos et al.^ recently reported the discovery of a cluster XMMUJ0044. 0-2-33 (hereafter XMMUJ0044) at z = 
1.579, detected in the X-ray data of the XMM-Newton telescope and later followed up spectroscopically. The cluster 
mass was estimated to be ~ 3.5 — 5 x 10^^M (Ti, f ar greater than the previous X-ray cluster of mass 5.7x 10"'^^ at z = 1.62 



reported by Tanaka, Finoguenov and Ueda [15| . We shall now use extreme- value statistics to study the probability of 
finding XMMUJ0044 as the maximum-mass cluster. In particular, we ask, is the existence of XMMUJ0044 consistent 
with /nl = 0? 
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B. Eddington bias 

We take the mass of XMMUJ0044 to be** Afobs = (4.46 ± 0.79) x 10^'^Mq (=(3.12 ± 0.55) x lO^^h-^Mrs,) and ask: 
what is the probabiUty that this cluster is the most massive one in the redshift range 1.579 ^ z < 2.2? Here the 
redshift upper bound is consistent with the highest redshift probed by the XMM survey. 

The reported cluster mass must first be corrected for Eddington bias, which refers to the apparent boost in the 
cluster mass due to the fact that it is more likely for lower-mass objects to scatter to high luminosity than it is for 
rarer massive objects to scatter to lower luminosity. We account for this effect by the correction ^20,] 

In M = In Mohs + ^ 7<m , (52) 

where crinM ~ 0.3 is the error estimated from the observation and 7 is the local slope of the mass function determined 
using the relation dn/d\nM oc M'^, and therefore satisfies 

d'^n I dn , ,^ _„, 

l=-nhr^ " = lnM. (53) 

dw^ I du 

The final masses with Eddington-bias corrections are listed in Table ID The corrected mass depends on n(> M) and 
hence it also depends on the mass function used. There is also a weak dependence on /nl (entering via the MVJ 
correction (IIGI) ). With /nl = 100, the changes in the corrected masses are less than a percent and it is reasonable to 
neglect this correction as long as |/nl| ^ 0(10^). We quote the corrected mass with /nl = in TableHl 

C. Redshift averaging 

The patch of interest is now a spherical shell whose thickness is determined by the redshift band Az. To account 
for the redshift variation within the shell, we perform the following modifications to the variables X and Y in the 
distribution function (|50p . 

X = {n{> M))V = dz dm-^^. (54) 

Jaz Jm dm dz 

Y = Ahcii(Af)(a). (55) 

The bias averaged within the shell, /Sshoib is derived in Appendix A (Eq. IA17I) and is given by 

L^F{L))+e 



-'she 



L3_^3 



-(G(Af)). (56) 



Here L and £ are comoving lengths corresponding to redshifts 2.2 and 1.579 respectively, (a), {F{L)) and {G{M)) 
are the redshift averages of (fTO]) . (|44|) and (|45|) . We define the redshift average of a quantity, Q(z), by 



TABLE I: Observed and Eddington-corrected mass for the cluster XMMUJ0044, in units of lO^/i'^Mg 

Observed mass Eddington-corrected mass 

(10"/i"^Mq) Press-Schechter Sheth-Tormen Tinker 



3.12 ±0.55 2.48 ±0.40 2.62 ±0.43 2.56 ± 0.41 



The mass of XMMUJ0044 published in [ijl is (4.25 ±0.75) X IO^'^A/q, given with respect to the critical density. Assuming NFW cluster 
profile, Waizmann et a/. [30| showed that, with respect to the mean background density, the value increases to (4.46 ± 0.79) X IO^^Mq. 
We thank J.C. Waizmann for bringing this point to our attention. 
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FIG. 7: Left: The probability n (Eq. [58]) that XMMUJ0044 is the most massive cluster in 1.579 < z < 2.2 plotted as a 
function of sky coverage (in square degree). Using the Tinker mass function, the probability is maximised to ~ 50% when /gw 
is of order 50 deg^. Right: The extreme- value pdf with coverages (from left to right) 80 deg'^, 1000 deg'^ and full sky (using 
the Tinker mass function and /nl ~ 0). Note from Table[l]that the cluster lies at 2.56 ± OAlh'^ Mq (vertical dotted lines). 

D. Results 

We are interested in the probability that a cluster of mass in the range [M + a, M — a] is the maximum-mass object 
observed in a survey with a given /sky and redshift range. Denoting this probability as 11, we can express it as the 
difference in the distribution function (1501) evaluated at M zL a. 



n = P(M + cr) -P{M -a). 



(58) 



We take the mass range to be those in shown in Table HI In addition to the choice of mass function explored in the 
previous sections, here we consider three other factors that affect 11. 

1. Dependence on fsky 

The XDCP survey covers a sky area of approximately 80 deg^ [14|. However, the value of /sky appropriate for our 
calculation must take into account all previous surveys that have explored the redshift interval in other parts of the 
sky, regardless of whether a positive detection is reported. 

In figure [7] (left panel), we show 11 as a function of survey area in square degree. Here H is calculated using the 
Tinker mass function and /nl = 0. We see that the observation of an extreme object such as XMMUJ0044 is most 
likely in a survey area of around 50 deg^ (where H ~ 0.5). At wider coverages, we expect the most probable extreme 
mass to be larger. In fact, taking /sky = 1 as the most conservative limit, we find the most likely extreme object in this 
redshift range to be a cluster of mass ^ 7 — 8 x lO^'^h^^ Mq (figure[7l right panel), well above the Eddington-corrected 
mass of XMMUJ0044. On the other hand, taking /sky = 80 deg^, the most probable extreme- mass object is consistent 
with XMMUJ0044, as the peak of the extreme value pdf lies within the mass estimate (vertical contours in figure [7]). 

It is difficult to estimate what is the correct value of /sky is needed in such cases and, unfortunately, the question of 
whether non-Gaussianity is needed to explain the existence of certain clusters depends sensitively on the value of /sky 
assumed. Making the most conservative interpretation using /sky = 1 and assuming that there are good prospects 
for discovering many more massive high-redshift clusters in the future, we conclude that XMMUJ0044 presents no 
tension with ACDM (see also the conclusions of J20l. Isij). 



Dependence on f^L- 



In figure |8] (left panel) we show the effect of introducing /nl = ±150 to the extreme- value pdf. Here we take a 
fiducial value of /sky = 1, and use the Tinker mass function. As expected, the pdf is shifted left or right depending 
on the sign of /nl- 
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FIG. 8: The extreme-value pdf for objects in the range 1.579 < z < 2.2 with a fiducial value for /gj^y = 1, using the Tinker 
mass function. Left: /nl is varied from —150 to 150 with as = 0.801 Right: erg is varied in the range as = 0.801 ± 0.03 [g] 
with /nl ~ 0. The degeneracy is discussed in the text. 



3. De 



ence on as 



The panel on the right of figure H] shows the equivalent effect when as is varied in the range as — 0.801 ± 0.03 
(taken from the WMAP 7-year constraint Q), whilst /nl is fixed to 0. The shift of the pdf towards more massive 
extreme objects follows from the fact that a greater as introduces a larger spread in the mass range of cluster in the 
survey volume. 

The similarity between the two panels of figure [5] refiects the well-known degeneracy between /nl and as UM, UM ■ 
This degeneracy can be easily broken, for instance, by the constraint on the galaxy power spectrum (which is sensitive 
to as but not /nl) or the CMB temperature anisotropies |19l |. 

In summary, the degenerate effects between /sky , /nl and as shown here imply that it is very difficult to deduce 
information on non-Gaussianity from the extreme-value distributions alone. The most sensible approach is combine 
the results from several cluster surveys (to achieve fgky — 1) with probes of the CMB (to break the CTg degeneracy), 
assuming selection effects and error in the mass determination can be kept in check. 



E. The most massive object in the Universe 



As a consistency check, we plot the extreme-value pdf for an extended redshift range z > and /sky = 1 in figure 
m This gives the extreme-value pdf for the most massive object in the Universe. With as = 0.801, We find this to be 
an object of mass 2 — 5 x 10^^/i~^Mq, depending on the mass function. The result using the Tinker mass function 
is Mmax ~ 3.5 X l0^^h~^MQ which agrees broadly with those reported by [22,[23|- The effect of /nl = 100 increases 
this value by by less than 10%. 



F. Extremal types 



Extreme-value distributions obey a limit theorem analogous to the Central Limit Theorem. This is the so-called 
Extremal Types Theorem^ which roughly states that extreme-value distributions converge to one of three possible 
types in the large-sample limit (see Appendix B). This beautiful theorem has found applications in areas such as 
meteorology, engineering and finance, where a large volume of data allows extreme- value statistics to be modelled by 
fitting only one or two parameters of an extremal type (analogous to fitting the mean and variance of the normal 
distribution) @[7^. 

In cluster cosmology, the dearth of cluster data at present is not ideal for application of the limit theorem, although 
there have been attempts to apply it to simulated data (see [23, 1231 )■ The question of which extremal type extreme- 
mass clusters converge to remains unclear, although there is some evidence from simulations that /nl appears to 
play no role in the convergence (28J, |80| . We concur with this latter view and present the detail our investigation in 
Appendix B. 
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FIG. 9: The extreme-value pdf for the most massive object in the Universe (with /gj^y = 1 and /nl = 0) for various mass 
functions. 



VII. CONCLUSION AND DISCUSSION 

In summary, we have investigated quantitatively how the statistics of extreme-mass clusters is affected by uncer- 
tainties in the mass function, non-Gaussian corrections of the mass function and bias, Eddington correction, /sky, 
redshift, /nl and cr^. More specifically, 

1. We have presented a procedure to calculate the statistics of extreme-mass galaxy clusters in the presence of 
primordial non-Gaussianity parametrized by /nl- Our main results are the expressions for the cumulative 
probability distribution (|50|) and the probability density function ()51|) for the most massive object in a survey of 
a given sky coverage and redshift range. These expressions enable us to deduce the most probable extreme-mass 
cluster in a survey of a given specification. The effects of changing the mass function and varying the value of 
/nl, survey volume and redshift are summarised in figures H] and [SJ 

2. Our correction terms for the extreme- value distribution (second and third terms on the right-hand side of (j50p ) 
are significant when considering a large volume, high redshift or large non-Gaussianity (see figure 15]). For 
non-Gaussianity with /nl = 0{1), the first term of ((5(1)) (Poisson approximation) suffices. 

3. Next, we applied our formalism to investigate the extreme-value properties of cluster XMMUJ0044. 0-2-33 
(Mobs — 3.12 X IQ^^h^^MQ at z — 1.579). We find that the probability that the cluster is the most likely 
extreme-mass cluster in the survey depends sensitively on the assumed sky coverage, but is consistent with 
/nl — (assuming a^ — 0.801). More conservatively, with /sky = 1, the most probable extreme-mass cluster is 
expected to be much larger and perhaps this will be confirmed by future X-ray cluster surveys. 

4. We show that the effect of /nl in shifting the extreme-mass cluster to higher values is degenerate with an 
increase in erg (figure |S]). The degeneracy can be broken by combining cluster data with CMB constraints. 

An important ingredient in our calculation is the mass function. In the presence of primordial non-Gaussianity, 
it remains to be seen what the correct mass function should be. Our investigation showed that the Press-Schechter, 
Sheth-Tormen and Tinker mass functions give similar extreme-value statistics at low redshift, but there are large 
differences at high redshift and large /nl- The understanding of the correct form of the mass function appropriate 
for these extreme-mass objects is important since the uncertainty in the distribution of extreme-mass clusters due to 
the mass function is comparable with that from the mass determination (typically crinM ~ 0.3). Thus, it remains for 
further numerical simulations along the lines of |45l . |46l| to establish the validity of the various mass functions and 
non-Gaussian correction factors in the presence of non-Gaussianity. 

Another crucial ingredient is the bias which, in this work, was calculated using the real-space formalism given 
in [SJ, I33 combined with our averaging procedure outlined in Appendix A. As pointed out in these papers, it is 
possible to extend the calculation to other types of non-Gaussianity (non-local or higher-order local type). It will be 
an interesting extension to study extreme-value statistics in the presence of different types of non-Gaussianity. 
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Appendix A: Volume-averaged bias and its approximation 

Let b'^{M) be the real-space bias associated with objects of mass M with /nl = 0. In the text, we have seen that 
h is independent of r to a good approximation. In the presence of non-Gaussianity, the large-r behaviour of h{M, r) 
is given by 

h{M,r) w f{r)g{M), where (Al) 



/(r) = l + i^(z)/NL 



(A2) 



g{M) = fe(A/,/NL = 0), (A3) 

and K{z) is independent of r, M and /nl- This approximation allows the averaging (|1U)) - (PT|) to be performed 
separately on /(r) and g{M). 

Our goal is to perform the averaging (1401) within a given volume. In the analysis of clusters lying within a redshift 
range Az, the associated volume is a spherical shell whose thickness is proportional to Az. Whilst the integration 
(|40p could, in principle, be evaluated using a 6-dimensional Monte-Carlo integration, in this Appendix we show how 
(|40|l could be reduced to a triple integral. The results presented here are clearly applicable to other fields in which 
volume averages are required. 

To begin, let us first consider an integral of the form 

/(Xi)= /dx2/(|xi-X2|), (A4) 

Jv 
where xi is a fixed vector and V^ is a sphere. 

1. Integration within a given sphere 

If xi lies inside a given sphere of radius L, we rotate the coordinate axes so that Xi lies along with the z-axis. 
We then translate the origin to the tip of xi . Let the spherical coordinates centred on this new origin be given by 
(u, 9, (p). One can show that the equation of the surface of the sphere is given by 



u 



^ -xicosO + J L^ - xlsiii^ 9, xi = |xi|. (A5) 



Hence, the integral (jA4p can be written as 



r2TT 



r PIT pU pZ 

/ dx2 /(|xi-X2|) = / siTi9d0 u^du d(j) f(u,e,(j)), (A6) 

Jv Jo Jq Jo 

r-l rU 



= 2tt I dfi duu^f{u), (A7) 

-1 Jo 



where the integration limit is given by 



U = -Xl^l+JL■^^xl{l~^J?). (A8) 



2. Integration outside a given sphere 

If xi lies outside a given sphere of radius d, the same transformation gives 
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/ dx2 /(|xi - X2I) = 27r / dill du u'fiu), (A9) 

where 

w± = -xi// ± -Jl^^^xlil^^). (AlO) 

3. Averaging within a shell 

Denote r = |xi — X2I. The net contribution of /(r) within a spherical shell of inner radius, £, and outer radius, L, 
is obtained by subtracting (jA9p from (|A7p . Finally, the average of / over the entire shell is obtained by integrating 
Xi over the sphere, and then dividing by the volume of the sphere. 

/.hell - J- / dxi / dx2 /(r), (All) 

•^shcU JV Jv 



47r '^ 



•^sholl 



/ x\dxi[^Ml - ^M] ^ (A12) 

Jt 



where 



Aiv 
T^shell = y(i'-^'). (A13) 

4. Applications to the bias 

Let us first apply (|A12p to calculate the bias averaged within a sphere radius L (eqs. (|43|) - (|45|) ). Substituting H — Q 
and /(r) as in (|A2[) we find 



6 

/sphere = 1 + -K{z)f^i^ 
o 



L -^ 



(A14) 



ft.-iMpc_ 
Therefore, the bias averaged over V for mass > M (Eq. B^ becomes 

/3(L,M)«/,phereG(M), (A15) 

where 

G{M) = \ b{m, /nl =0)-^ dm. (A16) 

n(> Ai) J ]^i dm 



More generally, with nonzero i, we find 



, £^/sphereW+^^ . , ^ „. 

•'-hell - L3-£3 ■ ^^^^) 



Note that by setting ^ = 0, we recover /sphere- To include the averaging over redshift, one performs the replacement 

/shell ^- 77 / dz fshcll-^- (A18) 

V Jaz dz 

Appendix B: Extremal Types 

The shape of the extreme-value distribution function holds valuable information about the statistical nature of 
galaxy clusters. The following theorem, which roughly states that extreme-value distributions converge to one of only 
three possible types, lies at the heart of extreme- value theory. 



18 



D) 




M, 



>14u-1 



/10'-h-'Ms,n 



FIG. 10: The quantile plots for the distributions of extreme-mass clusters for the three cases: (A) /nl = 100, 
L = 100/i"^Mpc, z = l, (B) /nl = 200, L = 500ft"^Mpc, 2 = 3, and (C) /nl = 0, L = 500/i"^Mpc, 2 = 3. The Tinker mass 
function was used. The concavity of these curves suggests that they belong to the Frechet class of distribution though they 
approach the Gumbel distribution at the high-mass end. This behaviour is insensitive to changes in all other parameters. 

Theorem (Extremal Types Theorem). Let Xi be independent and identically distributed random variables. Define 
the block maximum as Mn = maxi<i<„{Xi}. //, for some constants an > 0, bn, we have 

P{anMn + bn < x) — > G{x) as n — > oo, 

(in other words, if the distribution of rescaled maxima converges to a distribution G for large sample size), then G is 
one of the following distributions: 
L Gumbel type, G{x) — exp(— e^^) 

//. Frechet type, G{x) = < ' ~_ 

\exp{~y " 



in. Weibull type, G{x) = 



X > b 
exp(-(-2/)"), X <b 



1, X > b 
where y = ax + b, a,b,a are constants, a > and a > 0. 

See, for example, [8l| for the proof. In this Appendix, we investigate which of these extremal types do the 
distributions of extreme-mass clusters belong to. 

The following function will be useful in distinguishing between the three cases: 



g{x) = -ln{-lnP{x)). 



(Bl) 



In the case of the Gumbel distribution, g{x) — P^^jx:) — inf{y : P{y) > x}, which means that g{x) is the x-quantile 
of P. We shall refer to g{x) as the quantile function |79l.l83|. 

To see which extremal type a given extreme- value distribution, P{x), belongs to, one simply plots the quantile 
function and analyse its curvature for increasing patch size L. If the quantile is a linear, the distribution is of Gumbel 
type. If it concaves up {i.e. g"{x) > 0), the distribution is of Weibull type. If the quantile concaves down, it is of 
Frechet type. Note that the quantiles must be plotted on linear and not logarithmic scales. 

Figure [TOl shows the quantile plot, (/(Mmax), of the distribution of extreme-mass clusters, P(M,„ax), with /nl in 
the range — 200. The parameters for each curve are those listed in the cases (A), (B) and (C) in section FV C\ 
and the Tinker mass function is used. The concavity of these graphs clearly shows that the distribution of extreme- 
mass clusters are of the Frechet type, although the tails of the quantile graphs show an almost linear {i.e. Gumbel) 
behaviour. 

The Frechet distribution^ arises in situations when there is a natural lower limit in the distribution function 
{P{x) = for X < some constant). In our case, the definition of a galaxy cluster {e.g. via M200) translates to a 



Some applications of the Frechet distribution to environmental sciences are summarised in [781 [7! 
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loose lower bound on Mmax and this may explain why the distribution of extreme-mass clusters appears to be of 
the Frechet type. If only the high- mass tail of the distribution is taken into account, the Gumbel distribution is a 
reasonable approximation. As pointed out in [2J| , if the underlying distribution is exactly Gaussian, the distribution 
can be shown to converge to the Gumbel type, albeit very slowly. In any case, we find that -P(M,„ax), for all practical 
purposes, belongs to the Frechet family. 

This conclusion is remarkably robust against changes in /nl, mass function, redshift and patch size. It may be 
possible that this insensitivity stems from the truncation of the series (j50p . A more thorough approach to studying 
the extremal-type convergence is to fit the distribution to some functional form {e.g. see [28, 80] in which the extreme- 
value distributions are modelled as a generalised extreme- value distribution) or prove the convergence using one of 
the criteria given, for example, in [7^, |81|. Like (28|. p0ll. we find the convergence insensitive to the value of /nl- 



We note that, contrary to the observation in [23|, we found no combination of parameters which give rise to a 
Weibull distribution, which arises when there is a natural upper bound for the distribution function. Moreover, it is 
worth noting that if the pdfs such as those in figures |4] and [5] are well-approximated by 'skew-symmetric' functions 
{e.g. an Edgeworth expansion) then the distribution cannot converge to the Weibull type as proven in [83|. 

Finally, we point out an interesting fact that if the coefhcients in the expansion (I46p conspire to make P{x) an 
exactly Poissonian distribution 

P(x,A)=e-^^^, (B2) 

then the limiting distribution G{x) will completely degenerate to G = 1 or 0. This is one of the rare examples where 
the extreme-value distribution does not converge to any of the three standard distributions. Of course, we do not 
expect a realistic distribution of galaxy clusters to be exactly Poissonian. 
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